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ABSTRACT _ _ _ _ _^ _ _ ^ ^ 

Eighty undergraduate students participated in an 
investigation of the effects on acquisition of i^ versus 
30-second delay of reinforcement under either knonn or unknown 
re sponse contingency conditions^ The-'students in the feur 
experimental conditions were - required to depress three buttons in 
proper senquence. Following a correct response, the proper sequence 
systematically changed^ The task was to discover the rule which 
determined how the sequence changed. Criterion was_ three consecutive 
correct responses^ A two-way analysis of variance demonstrated 
significant main effects fbr.bbth the temporal reinforcement delay 
and response 'Ccntingehcy _cbnditiphr and for the interaction effect. 
Immediate reinforcement facilitated acquisition better than 
reinforcement that was delayed 30-seconds. Acquisition was also-^_ 
better facilitated when the subject had knowledge of the response 
contingency. The interaction effect demonstrated that the.^ difference 
between acquisition rates for immediate and delayed reinforcement 
increased when the subject was unaware of the response_ contingency* 
The results affirmed the findings cf research supporting the use of 
immediate, and contingent reinforcement for acquisition. 
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Delay of Reinforcement and 
Knowledge of Response Cbntirigenciies 
in Human Learning 

The consistent demohstratibri in research Uising non-humans that a learn- 
ing decrement results as a function of increasing the rieinf orceraiBht delay 
interval is much less stable in investigations using studients (Renriier, 1964) i 
While Sassenrath, Yon^^e , and Shrable (1968) found that itninediate feedback 
facilitated learning better than delayed feedback, in the same year two of 
the same investigators (Sassenrath & Yonge, 1968) reported delay groups 
learned better than immediate feedback groups. The effect of deiayinp; rein- 
forcement in classroom-related learning has varied as a function of the 
nature of the task (Bourne, 1957: Sax, 1960; Wittrock, 1964; Kulhavy, 1971) 
and task meaningf ulness and difficulty (Angell, 1949; Hockman and Lipsitt, 
196D). Differing results have also emeirged by varying the length of the 
intertrial interval (Markbwitz & Renner, 1966; Frayer^ 1971; Piper^ 1971; 
& Sturges, 1972). 

The intertrial interval, or temporal span from the termination of one 
response to the initiation of the next response, is apparently the critical 
parameter in delay or reinforcement studies. The period of time following 
a correct response but prior to the reirif brcemeht is the prie-reiriforcemerit 
interval. The period of timiB fbllbwing the rieinf orcement but prior to 
thiB next response is the post-reinforcement interval. Boersma (1966) con- 
tended that extending the pre or post-reinforceraent interval is not sufficient 
to inhibit learning, but that interpolated, competing responses are. Brackbiii 
Bravos, and Starr (1962) have suggested a person maintains a memory trace of 
a correct response and that competing responses daring the pre-reinf orceraent 
interval interfere. Markbwitz and Renner (1966) have dissagreed, contending 
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that a person does riot mediate until realizing a correct resporise was made; 
i.e., during the pbs t~reinfbrcement interval. Ihterpblatdd responses during 
the pre-reinf brcemerit interval wbuld , there fbre, not be competing. 

Markowitz and Rerinisr (1966) i>binted but that many studios (e.g., I3rackhill 
and Kappy, 1962) included knowledge of results at the end of the intortrial 
interval, re-establishing the Ibst itiembry trace resulting from the initial 
delay. They contended that a reinforcement delay wbUld decrease learning if 
the later knowledge of results were omitted. 

The present investigation was designed to determine the effects bf de- 
layed reinforcement with and without re-establishing the response cbritirigehcy 
through post-reinforcement knowledge of results. It was hypothesized that 
1.) with post-reinforcement knowledge of results the inhibiting effect frbm 
delayed reinforcement would diminish because the subject would not need tb 
mediate during the delay interval, 2.) without post-reinforcement knowledge 
of results mediation would be necessary because the interi>oia ted responses 
would have competing properties causing the reinforcement delay to inhibit 
learning> and 3.) delayed reinforcement and the exclusion of post-reinforcement 
knowledge of results would produce inhibitory effects which were additive. 
Thus, there were four conditions. Reinforcement was either immediate (I) 
b,r delayed (D) for 3d-sec and the response contingency was either known (K) 
by giving post-reinforcement knowledge of results or unknown (U) by withholding 
post-reinforcement knowledge of results. 
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Method 

Subjects 



Subjects were 80 undergraduate student volunteers enrolled at Kansas 
State University. The students were randomly assigned to four groups; 
There were 3 7 males and 43 females. 
Apparatus 

The control panels for the first author and the student were 6.4 mm by 
23.0 mm metal chassis mounted on a 30.5 mm by 52.0 mm wooden base. A 1.0 m by 
1.5 m cardboard partition was placed upright between the control panels. 
These materials were painted flat black. The student's panel had throe neon 
lights mounted 2.6 mm apart horizontally and 1.9 mm from the top of the panet. 
Red aceta'ie was placed in front of each' light. One normally-on push button 
was mounted directly under each light and 1.9 mm from the bottom of the panel. 

neon light was centered vertically and mounted 1.3 mm from the left edge 
of the panel. A yellow plastic cap was placed over this light. The author's 
control panel was identical to the student's except there was no yellow 
light. Two mercury switches were secured to the base and to the right of the 
control panel to activate the timers. There were two Hunter automatic inter- 
val timers with silent relays to the left of the control panel. A cassette 
recorder with 15-min tape was plactid beside the timers . 

The cassette recorder and tape were used to produce an audible click each 
15-sec to signal the start of each trial. Depression of the three push but- 
tons by the student was monitored by the author with the throe red panel lights 
Each mercury switch controlled one of the timers. Tho tiinors wore both set 
for either a 0-sec or 30-sec delay interval. The timers tnrnod on the yellow 
reinforcement light on the student's panel. Turning the mercury switch off 



Delayed Rbihf orceinent 
5 

turned out the riBirif drcemerit light. Thie author's three push buttons, when 
sequentially depressed, turniBd on the student's threie red lights in cor- 
responding seiquiBnciB. The second tiiner was necessary if the student emitted 
a second correct response during the existing 30-sec delay interval. 
Procedure 

When seated at the apparatus, the student was told that the task Was to 
depress ail three buttons in the proper sequence, depressing each button only 
once. The student was told that the yellow light would come on if the se- 
quence was correct. Also, the student was informed that the correct sequence 
V70uld change after each correct response, and that the ultimate task was to 
determine the rule which identified how the sequence changod. Students in 
the delayed reinforcement (D) groups were told that when the yellow light 
came brii it would not necessarily mean that the last response ^iven was the 
correct one. In the contingency known (K) groups where post-reinforcement 
knowledge of results was given, students were told that after the yellow 
light came on, the three red lights would come on one at a time in the same 
sequence the student had used in correctly responding. Thus^ all students 
were told the yellow light signified a correct response. Students in the D 
groups were told the correct trial may have been any one of the responses 
since the last reinforcetiient . Students in the K groups were aided by a visual 
display of the correct response sequence. 

The i5-sec intertrial interval tape made an audible click to start each 
trial. On a correct trial the author threw the timer switch to activate the 
reinforcement process. If a second correct reispcnse foUowed in the next 
two trials, the second delay timer was activated. For the KT studeriLs the 
reirifbrcemerit light came on immediately. To identify the response contingency 
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for the student, the author depressed his three buttons duplicating the stu- 
dent's correct response. This sequence appeared on the student's control 
panel. The author then turned off the timer switch, turning out the yellow 
reinforcement light. The cassette tape again clicked and a new trial began. 
After a reinforcement following a correct response, however, the sequence 
changed. The starting point moved back one button to the left. For example, 
if the first correct sequence was 2, 3, 1, the next correct sequence follow- 
ing a reinforced response became 3^ 1, 2. This was followed by 1^ 2, 3^ etc. 
For the students in the KD group^ the procedure was the same, except that the 
timer delayed the onset of the reinforcement light for 30-siBc. After the 
light came bh, the author again dupllicated the studerit^s correct response. 
The students in the UI and UD groups did not receive response confirmation 
from the red lights. Thus, they were not infonned of the response contin- 
gency. To suiffliiarize , the reinforcement light was activated for all students 
under (either I or D conditions, but only the students in the K conditions 
received knowledgie of results via the three red lights. The criterion was 
three consecutive correct responses, or 60 trials. Maximum time for testing 
was 15-min. The data were analyzed by using a ccmpieteiy randomized two-way 
analysis of variance. 

Results 

One student in the UI condition and two students in the UD conditions 
did not complete the experiment. Their data were discarded and alternate 
students were selected. 

The data analyzed were total number of trials to criterion. The means 
for the KI and UI conditions were 12.25 and 14.45^ respectively. The means 
for the KD and UD conditions were 32.60 and 55.85^ respectively. Standard 
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deviations ranged from 5.99 to 9.19. Table 1 saramarizes the two-way analysis 
of variance which resulted in significance for delay of reinforcement effects; 
L (li76) = 134.94, £ <.dOl; response contingency effects, £ (1,76), £ < .001; 
and their interaction F (1,76) = 15.68, £ < .001. 

Insert Table 1 about here 



The effects of temporal delay of reirif brcieraerit varied across levels of 
knowledge of the response cdritirigericy to produce the interaction shown in 
Figure 1; 



Insert Figure 1 about here 



Discussion 

This investigation was supportive of the use of immediate and contingent 
reinforcement for a simple learning task. Delaying reinforcement clearly de- 
creased learning. Removing post-reinforcement knowledge of results made inter- 
polated responses during the delay interval cbmpietihg^ and further reduced 
learning as demonstrated by the interaction effect. Many researchers (Saltzman, 
karifer, and Greenspoon, 1955; Noble and Alcock^ 1958; Brackbill and Kappy, 1962; 
Smith, 1963; Brackbill, Wagner, and Wilson, 1964; Sturges, 1970) have found 
non-significant results between immediate and delayed reinforcement. These 
studies, however i either excluded interpolated responses and/or included post- 
reirif or cement knowledge of results. In the present inves tigation^ the UD group 
did poorest. Because the response contingency was not identified via post- 
reinf orceinent kriowledgiB of riBSults, a valid delay interval was created. 
Necessary mediation during this delay was impeded due to competing responses. 
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Students in the KD group learned better than studierits in the UD group because 
mediation during the delay interval was not needed. The iritieraction showed the 
effects of reinforcement delay and an unknown respbrisie cbritirigericy to be additive. 
Because of the iimisediate temporal proximity of the correct response and rein- 
forcement, students in the KI arid UI groups learned best. Since students in 
both of these groups were instructed that reinf orceinent would immediately 
follow a correct response, identification of the response contingency should not 
have provided additional information. 

Many studies have used meaningful verbal material rather than simple 
learning tasks, and have extended the delay interval to as much as 7-days 
(Bilodeau and Bilodeau, 1958). Future delay of reinforcement research which 
varied task difficulty under conditions of known and unknown response contingen- 
cies would seem worthwhile. Optimum delay intervals should vary as a function 
of the nature of the learning task involved. 

Sturges (19 72) found that with equivalent pre-reihf orcement intervals ^ 
persons learned less the longer the posr- . r inf brcement interval became, A 
person would not necessarily be expected to repeat a given behavior immediately 
after it was reinforced; i.e., a pbst-reirif brcement interval of time would 
elapse. A study bf the effects bf increasing the pbst-reiriforcement interval 
with and without pbst-reinf brcement knowledge bf results cbuld provide 
additional, qualifying data. 
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Table 1 

Two-way Analysis of Variance 
Summary Table 

Source of variation df^ SS MS F 

Total 79 34,257.40 446.29 

Reinforcement (I~D) 1 19,065.33 19,065.33 134.94* 

Contingency (K-U) 1 3,238.53 3,238.53 22.92* 

I-D X K-U 1 2,215.49 2,215.49 15.68* 

Error 76 10,738.05 141.29 

* £ <.001, 
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Figure Caption 

Figure 1. interaction effect between reinforcement delay interval and 
response contingency condition. 
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